f 




1 3. The method of claim 2, wherein after determining the short-term averaged 

2 energy and the long-term averaged energy, the method further comprises: 

3 determining whether a sum of the short-term averaged energy and a factor is greater 

4 than the long-term averaged energy; and 

5 determining that the current audio frame represents silence if the sum is less than the 

6 long-term averaged energy, without necessitating a determination of the peak-to-mean 

7 likelihood ratio. 

1 4. The method of claim 3, upon determining that the sum is greater than the 

2 long-term averaged energy and before determining the peak-to-mean likelihood ratio, the 

3 method further comprises : 

4 determining whether a difference between the long-term averaged energy and the 

5 short-term averaged energy is less than a predetermined threshold; 

6 determining that the current audio frame represents voice if the difference is greater 

7 than the predetermined threshold; and 

8 continuing by determining the peak-to-mean likelihood ratio if the difference is less 

9 than the predetermined threshold. 

1 5. The method of claim 2, wherein the determining of the short-term averaged 

2 energy comprises: 

3 determining an energy, in decibels, of the current audio frame; 

4 determining a short-term averaged energy for a prior audio frame; and 
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,5 " conducting a weighted average of the energy of the current audio frame and the short- 
6 term averaged energy for the prior audio frame. 




— u B 

(Twice Amended) A method for enhancing voice activity detection 

compnsii 

3 detkmining a peak-to-mean likelihood ratio, the determining a peak-to-mean 

4 likelihood ra\io comprises 

5 \calculating an averaged peak-to-mean ratio for the current audio frame, 

6 determining a maximum averaged peak-to-mean ratio, 

7 determining a minimum averaged peak-to-mean ratio, 

8 deteraiining a difference between the maximum averaged peak-to-mean ratio 

9 and the averaged peak-to-mean ratio for the current audio frame, 

10 determining a difference between the maximum averaged peak-to-mean ratio 

1 1 and the minimumWeraged peak-to-mean ratio, and 

12 conducting a ratio, a denominator of the ratio being the difference between the 

13 maximum averaged\^eak-to-mean ratio and the minimum averaged peak-to-mean 

14 ratio, the numerator being the difference between the maximum averaged peak-to- * 

1 5 mean ratio and the averaged peak-to-mean ratio; and 

16 comparing the peak-to-mean likelihood ratio to a selected threshold to determine 

17 whether a current audio frame represent s a voice signal 

1 8. (Amended) IThe communication module of claim 12 , wherein the voice 




2 activity detector, when exjecuted , controls the processing unit to determine whether a sum of 

3 the short-term averaged energy and a predetermined factor is greater than the long-term 
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4 averaged^nergy, and to signal that the current audio frame represents silence if the sum is 

5 less thaytfte long-term averaged energy. 



1 9. The communication module of claim 8, wherein the voice activity detector, 

2 when executed, controls the processing unit to determine whether a difference between the 

3 long-term averaged energy and the short-term averaged energy is less than a predetermined 

4 threshold, and to signal that the current audio frame represents voice if the difference is 

5 greater than the predetermined threshold. 

1 10. The communication module of claim 9, wherein the voice activity detector, 

2 when executed, controls the processing unit to determine the peak- to-mean likelihood ratio, 

3 and to compare the peak-to-mean likelihood ratio to a selected threshold to determine 

4 whether a current audio frame represents a voice signal. 

1 11. The communication module of claim 1 0, wherein the voice activity detector, 

2 when executed, controls the processing unit to determine a peak-to-mean ratio by (i) 

3 sampling an analog signal a predetermined number of times to produce a plurality of sampled 

4 signals each having a sampled value, (ii) determining a maximum value of the plurality of 

5 sampled signals, and (iii) conducting a ratio between an absolute value of the maximum 

6 value and a summation of the sampled values for the plurality of sampled signals. 

1 12. (A^nded) A communication module 

2 a substrate; 

3 a processing unit place&on the substrate; and 
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a memory coupled to the processing unit, the memory to contain a voice activity 
detector winch, when executed, controls the processing unit to determine an averaged peak- 
to-mean ratio fcfcuhe current audio frame by (i) monitoring a maximum averaged peak-to- 
mean ratio and a minknum averaged peak-to-mean ratio, (ii) determining a first result being a 
difference between the ma^mium averaged peak-to-mean ratio and the averaged peak-to- 
mean ratio for the current audio MB^me, (iii) determining a second result being a difference 
between the maximum averaged peak-bsemean ratio and the minimum averaged peak-to- 
mean ratio, and (iv) conducting a ratio betwebt^the first result and the second result to 
produce the peak-to-mean likelihood ratio. 



1 3 . (AmeSied) A machine readable medium having embodied thereon a 
computer program for pro^sing by a machine, the computer program comprising: 

3 a first routine for determinihg^a normalized peak-to-mean likelihood ratio; and 

4 a second routine for comparing theteak-to-mean likelihood ratio to a selected 

5 threshold to determine whether an audio frame b&ng transmitted represents a voice signal. 



1 14. The machine readable medium of claim 13, wherein the computer program 

2 further comprising: 

3 a third routine for determining a short-term averaged energy for the audio frame, the 

4 third routine being executed before the first and second routines; and 

5 a fourth routine for determining a long-term averaged energy for the audio frame, the 

6 fourth routine being executed before the first and second routines. 
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1 15. The machine readable medium of claim 1 4, wherein the computer program 

2 further comprising: 

3 a fifth routine for determining whether a sum of the short-term averaged energy and a 

4 predetermined factor is greater than the long-term averaged energy, the fifth routine being 

5 executed before the first and second routines; and 

6 a sixth routine for determining whether a difference between the long-term averaged 

7 energy and the short-term averaged energy is less than a predetermined threshold, the sixth 

8 routine being executed after determining that the sum is greater than the long-term averaged 

9 energy and before execution of the first and second routines. 

1 16. The machine readable medium of claim 15, wherein the fifth routine 

2 determining that the current audio frame represents silence if the sum is less than the long- 

3 term averaged energy. 

1 17. The machine readable medium of claim 15, wherein the sixth routine 

2 determining that the current audio frame represents voice if the difference is greater than the 

3 predetermined threshold. 




1 18. (Amehd^d) A voice activity detector comprising: 

2 circuitry to determin^short-term averaged energy for an audio frame; 

3 circuitry to determine a loHg^term averaged energy for the audio frame; 

4 circuitry to determine whether tnbsshort-term averaged energy is greater than the 

5 long-term averaged energy by a predeterminea^aQtor; 
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6 circuity to determine whether a difference between the long-term averaged energy 

7 and the short-tetm averaged energy is less than a predetermined threshold when the short- 

8 term averaged enetgy is greater than the long-term averaged energy by the predetermined 

9 factor; 

10 circuitry to determine a normalized peak-to-mean likelihood ratio when the difference 

1 1 between the long-term av^^^dra^gy)and the short-term averaged energy is less than the 

12 predetermined threshold; and 

13 circuitry to comparing the pe^k-to-mean likelihood ratio to a selected threshold and to 

14 determine that the audio frame represents^ voice signal when the peak-to-mean likelihood 

1 5 ratio is greater than a selected threshold. 



1 
2 
3 
4 
5 
6 
7 
8 




(New) A method for enhancing voic^ctivity detection comprising: 
lining a peak-to-mean likelihood ratio including (i) a denominator having a 
value substantially equal to a difference between a maximum averaged peak-to-mean ratio 
and a minimum aWaged peak-to-mean ratio and (ii) a numerator having a value 
substantially equal to s ^fefenc^rbetween the maximum averaged peak-to-mean ratio and 
the averaged peak-to-r lea^ ratio; and 

comparingihe^e^thrmean likelihood ratio to a selected threshold to determine 
whether a current audiq frame represents a voice signal. 



1 21. (New) The method oftclaim 20, wherein prior to determining the peak-to- 

2 mean likelihood ratio, the method further comprises: 

3 determining a short-term averagedWergy for the current audio frame; and 

4 determining a long-term averaged energy for the current audio frame. 
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(New) The method of claim 21, wherein after determining the short-term 
averaged ekergy and the long-term averaged energy, the method further comprises: 

determining whether a sum of the short-term averaged energy and a factor is greater 
than the long-tWi averaged energy; and 

determining that the current audio frame represents silence if the sum is less than the 
long-term averaged ^nergy, without necessitating a determination of the peak-to-mean 
likelihood ratio. 

23. YNew) The method of claim 22, upon determining that the sum is greater than 
he long-term averaged energy and before determining the peak-to-mean likelihood ratio, the 
method further comprises: 

determining whether a difference between the long-term averaged energy and the 
short-term averaged energy is less than a predetermined threshold; 

determining thatme current audio frame represents voice if the difference is greater 
than the predetermined threshold; and 

continuing by determining the peak-to-mean likelihood ratio if the difference is less 
than the predetermined threshold. 



24. (New) The n^ethod of claim 21, wherein the determining of the short-term 
averaged energy comprises/ 

determining an energy, in decibels, of the current audio frame; 
determining a shprt-term averaged energy for a prior audio frame; and 
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conducting a weighted average of the energy of the current audio frame and the short- 
term averaged energy for the prior audio frame. 
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